RECONSTRUCTION FOR COLORINGS ON TREES 

NAYANTARA BHATNAGAR*, JUAN VERAt, ERIC VIGODA*, AND DROR WEITZ § 

Abstract. Consider fc-colorings of the complete tree of depth i and branching factor A. If we 

fix the coloring of the leaves, for what range of k is the root uniformly distributed over all fc colors 

(in the limit i — > oo)? This corresponds to the threshold for uniqueness of the infinite- volume Gibbs 

measure. It is straightforward to show the existence of colorings of the leaves which "freeze" the 

entire tree when fc < A + 1. For fc > A + 2, Jonasson proved the root is "unbiased" for any fixed 

coloring of the leaves and thus the Gibbs measure is unique. What happens for a typical coloring 

of the leaves? When the leaves have a non-vanishing influence on the root in expectation, over 

random colorings of the leaves, reconstruction is said to hold. Non-reconstruction is equivalent to 

^ ^ extremality of the free-boundary Gibbs measure. When k < A/ In A, it is straightforward to show 

Q , that reconstruction is possible (and hence the measure is not extremal). 

\—^ ■ We prove that for C > 1 and k = CA/lnA, non-reconstruction holds, i.e., the Gibbs measure 

^H ' is extremal. We prove a strong form of extremality: with high probability over the colorings of 

/»^«^ ' the leaves the influence at the root decays exponentially fast with the depth of the tree. Closely 
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related results were also proven recently by Sly. The above strong form of extremality implies that 
a local Markov chain that updates constant sized blocks has inverse linear entropy constant and 
hence 0(N log N) mixing time where N is the number of vertices of the tree. Extremality on trees 
and random graphs has received considerable attention recently since it may have connections to the 
efficiency of local algorithms. 
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1. Introduction. Reconstruction on trees arises naturally in a variety of con- 
rf-\ , texts. In evolutionary biology it is intimately related with the efficiency of inferring 

^ ' phylogenetic ancestors [28l E] . It appears in communication theory in the study of 

'nJ" , noisy computation [H]. Most closely related to the origins of this work, for spin 

systems from statistical physics, the threshold for reconstruction is equivalent to 
the threshold for extremality of the infinite-volume Gibbs measure induced by free- 
boundary conditions (see, e.g., [151), and is closely connected to mixing properties 
of the single-site update Markov chain known as the Glauber dynamics [H [21]. In 
fact, Berger et al [1] showed that 0(A'^logA'') mixing time of the Glauber dynamics 
f^ , implies extremality. Recent work of Krzakala et al 'W\ used statistical physics meth- 

ods to study extremality for random colorings and random /c-SAT, and suggests there 
are interesting connections between thresholds for extremality and efficiency of local 
algorithms. 



^ , Our general setup is a complete tree with branching factor A and depth £, which 

we will denote as Ti = (V^, Ei). (To clarify, the maximum degree of this tree is A-|- 1.) 
A fc-coloring is an assignment a : Vi —> [k] = {!,..., A:} where {v,w) G Ei implies 
(7{v) y^ a(w). Let fii denote the uniform measure over fe-colorings of the tree. Finally, 
we denote the leaves of the tree Te by Lg. When the height of the tree is clear from 
the context, we will drop the subscript and just use L. 

In the reconstruction problem, we are asking whether a typical coloring of the 
leaves influences the conditional measure at the root. In words, we choose a random 
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coloring of the tree, fix the induced coloring for the leaves, and then choose a random 
coloring of the internal vertices consistent with the leaves. If the root has some non- 
vanishing bias to any color then we say reconstruction is possible. Here is a precise 
definition. 

Definition 1.1. The reconstruction problem for Ti is solvable if there exists 
c G [k] such that 



hm E^^ 



/if (cr(root) — c I a{L) = t{L)'^ — 1/k 



>0. 



Non-reconstruction is equivalent to extremality of the corresponding Gibbs measure 
of the infinite tree (i.e., it can not be expressed as a convex combination of other 
Gibbs measures), see, e.g., [T5] . 

In the corresponding reconstruction problem for the Ising model, the reconstruc- 
tion threshold has been precisely established [B] . Recent works have improved bounds 
on the reconstruction threshold for the case of the hard-core model on weighted in- 
dependent sets HH m]. More recently, Borgs et al [S] showed that for binary 
asymmetric channels, the Kesten-Stigum eigenvalue bound is tight for reconstruction 
non-solvability in the case of limited asymmetry. 

The problem of determining the reconstruction threshold for colorings on the tree 
was discussed in [71 [351 121] ■ The threshold for uniqueness of the Gibbs measure on 
trees is now well-known. In contrast to extremality, uniqueness of the Gibbs measure 
requires that for every fixed coloring of the leaves, the root is not biased to any color. 
Jonasson [19] proved that for fc > A + 2 the Gibbs measure is unique [1^, whereas 
for A; = A -I- 1 there are "frozen" boundary conditions in the sense that the root has 
only one possible color, see [7] for more on the topic of uniqueness for colorings. 

It was known [30] that if fc < (1 — e)A/lnA for any e > then reconstruction 
is solvable, and more precisely as established in [33] . reconstruction is solvable for 
A > fc(lnfc + \nlnk + 1 + o(l)). Recent works [201 131] used the so-called replica- 
symmetry breaking method to conjecture that the reconstruction threshold for k- 
colorings of the tree T^ is at A = k{\nk + In In A; -I- 1 -I- o(l)). From an algorithmic 
perspective it is interesting that these bounds on the degree are close to the current 
best results for efficiently coloring Erdos-Renyi random graphs G'(n, A/n) Ij. 

In this work we prove that the reconstruction threshold is at [^^(l + o(l))- In 
particular, we show that for C > 1 and k > CA/lnA reconstruction is not possible, 
with the bias of an average boundary condition decaying exponentially fast with the 
height of the tree. 

Theorem 1.2. Let C > I and e = min{C — 1,1/3}. There are constants 
a{e) > 0, Aq = Ao(e), and £o — io{e) such that for all A > Aq, every k > CA/lnA, 
and every color c £ [k], for £ > £o, 



Et-^ 



/i£(cr(root) = c I a{L) = t{L)) - 1/fc 



< A" 



We also prove a high concentration result, where a much stronger tail bound (on the 
fraction of biasing boundary conditions) is established. 

Theorem 1.3. Let C > \ and e = min{C — 1,1/3}. There are constants 
a{e),a'{e) > 0, Aq = ^0(2) and £0 — £o{e) such that for all A > Ao, every k > 
CA/lnA, and every color c e [k], for £ > £0, 



Pr. 



fj,i{a{root) = c I cr{L) = t{L)) - \/k 



> A- 



<e-^° 



Remark 1.4 (A Note on Comparison with Previous Work). An earlier version of 
this paper proved the above results for C > 2, which was the first work giving the right 
order for the threshold for non-reconstruction fj'- Stated in terms to make comparison 
with other work easier, it was shown there that non-reconstruction holds when A < 
2 fc In k~o{k In k) . The bound was sharpened by Sly in 134% using independent methods, 
and showing that non- reconstruction holds when A < /c[ln /c + In In fc + 1 — In 2 — o(l)] . 
Subsequent to this the authors of this paper refined the approach of /5/ to obtain the 
current results, showing non-reconstruction when A < fclnfc — o(fclnfc). Though the 
bound on the reconstruction threshold in J3^ is more precise, we feel our results are 
still of independent interest for the following reasons. 

The techniques used in the current work (an extension of those appearing in \3I, 
utilizing ideas from \23f ) are independent and different from the analytical approach 
of 134^ . There may be some insight to be gained from this proof approach. Secondly, 
as a consequence of the strong concentration result stated above, using results of Mar- 
tinelli et al '2^ \24h *^ follows that a local Markov chain (in each step the colors of 
a constant number of vertices are updated) has inverse linear entropy constant and 
hence 0{N log N) mixing time. 

We state the relevant definitions before formally stating the result for the dynam- 
ics. We consider dynamics on the tree T„, and analyze its performance as a function 
of the volume of the tree N = \Tn\. In each step the dynamics updates the colors 
of a block of vertices. Specifically, for a fixed parameter £, for every w S T„ let By^g 
be the subtree (of r„) of depth £ rooted at v, i.e., By^^ includes all vertices in T„ 
that are descendants of v (including v) and whose distance from v is at most £. We 
analyze the dynamics Mi, which makes heat-bath updates in a random block By^i. 
Specifically, let the state space fl be the set of proper /c-colorings of T„. From Xt ^ ft, 
the transition Xt -^ Xt+i is defined as follows: 

• Choose a vertex v uniformly at random from T!„. 

• For all w ^ Byj, set Xf+i(w) — Xt{w). 

• Choose Xt+i{By^() uniformly at random from the set of fc-colorings consistent 
withXt+iiTn\k,i)- 

The case when € = is the well-studied single-site dynamics known as the Glauber 
dynamics. It is easy to see that for fc > 4 and for all £, Aie is an ergodic Markov 
chain with unique stationary distribution tt uniformly distributed over f2. The central 
question is the mixing time defined as: 

T,„i, = maxminji : ||P*(Xo, •) - tt\\ < l/2e} 

where P*(Xo, •) is the distribution after t steps of the dynamics starting from coloring 
Xq and II • II denotes variation distance. The choice of the constant l/2e implies that 
variation distance < e can be achieved after < [In l/£]Tmix steps [2]. 

We bound the mixing time via the entropy constant. Let / : fi — > R be an 
arbitrary test function. The entropy of / is 

Ent(/) :- Ent,(/) - E,[/ln(/)] - E,(/) ln[E,(/)], 

and the entropy constant of the Markov chain is defined as 

. ^ Ent(/) - Ent(P/) 
/>o Ent(/) 



where the infimum is over non-constant functions /. The entropy constant, which 
bounds the rate of decay of entropy, provides a good bound on the mixing time. In 
particular, standard results (see, e.g., [TUl ISl ISZ] ) imply: 

Tmi. < 0[a^' loglog(7r-ij] < 0{a^' log TV). 

We prove the following result about the dynamics. 

Theorem 1.5. Let C > I and e = min{C — 1,1/3}. There are constants 
Ao = Ao(£), and i = i{e) such that for all A > Aq, every k > CA/ln A, and all n, 
the entropy constant of the dynamics Me on r„ satisfies 

1 

Or > 

47V 
and consequently the mixing time satisfies 

T^i^ = OiN\ogN). 



It is an interesting open problem to prove 0{N log A'^) mixing time of the Glauber 
dynamics for the same range of colors as the above theorem. The best known results 
for the mixing time of the Glauber dynamics on the tree are Martinelli et al 24J who 
proved 0{N log N) mixing time when A; > A + 3 for arbitrary boundary conditions, 
and Hayes et al pjj proved polynomial in N mixing time (specifically, 0*{N'^) mixing 
time) when k > lOOA/ log A for any planar graph. For complete trees, Goldberg et al 
[17j and Lucier et al [21j recently showed the mixing time is polynomial for any fixed 
k and A. 

Finally, a related problem is the reconstruction threshold on Erdos-Rcnyi ran- 
dom graphs G{n,p) with p — A/n, so-called sparse random graphs. Recent work of 
Gerschenfeld and Montanari |T6j gives sufficient conditions under which extremality 
on the tree is equivalent to extremality for sparse random graphs. They showed that 
the conditions are satisfied for the g-state Potts model at all temperatures except 
zero temperature, which is the case of proper colorings. Subsequently in [26., the 
conditions were also verified in the case of proper colorings. 

2. Proof outline and outline of paper. Our proof is divided into two major 
parts. In the first part, which is contained in Section[3l we prove that with very high 
probability over the colorings of the leaves, the root is not too highly biased in favor 
of any color. Roughly, with probability > 1 — e^ over random colorings X of the 

leaves, for every color c. 



fie (cr(root) = c | aiL) = X) < A"^/^ 

where e is as defined in Theorem 11.31 We prove this statement by analyzing the 
same recurrences as used by Jonasson [12] in his proof for uniqueness on the tree 
when A: > A -|- 2. His recurrences express the marginal distribution at the root of a 
tree of size £ in terms of the marginals for trees of size £ — 1. The difficulty in our 
setting is that when fc < A + 2 it is unclear if the recurrences converge to the uniform 
distribution as a fixed point. 

In the second part of the proof (which appears in Section [4|) we use a two stage 
coupling similar to that used in [23] together with the bound on the maximum prob- 
ability of a color from the first part to establish the decay of correlation stated in 



Theorem 11.21 We then add arguments (taken from [53] ) to get the stronger Theo- 
rem [O] 

Finally, in Section [5] we prove Theorem 11.51 establishing fast convergence of the 
block dynamics for constant sized blocks. 

3. Unbiasing leaf colorings. In this section we show that for most colorings 
of the leaves X, at all vertices far enough from the leaves, the color conditioned on X 
has "sufhcient" randomness. 

We call a vector X G {•, 1, . . . , k} a (partial) coloring of the leaves Lh- The 
notation X^ = 7k- is used to denote that the i-th leaf is not assigned a color. We say 
that X is allowed if there exists a coloring a of the tree Th which is consistent with 
X, i.e., for every leaf z, if X{z) £ {1, . . . , fc} then a{z) = X{z). 

Given an allowed coloring X of Lh and c G [k] we define Ph {X, c) inductively by 



Po{X,c) = { 



1 ifX = c \{{l-Ph-i{X,,c)) 

l/k ifX=* PhiX,c) = ^^ 

oherwise ^ J|(i _ p^_,{X,, d)) 

dG[k] i=l 



where X — (Xi, . . . , Xa) with each Xi G {•, 1, . . . , k}'^ . An inductive argument 
shows that Ph {X, c) is well-defined for all allowed colorings X and all colors c. 

A simple computation shows that for allowed colorings X, Ph{X, c) is, in fact, the 
probability the root is colored c conditioned on X at the leaves. Note that conditioning 
on a partial coloring X simply means conditioning on the vertices which are assigned 
colors in X. 

Lemma 3.1. For all h, and all allowed colorings X of Lh, and all c, 

Hhi<y{root) = c I a{Lh) = X) = Ph{X, c). 



Proof. The proof is by counting the appropriate sets of colorings and induction 
on h. Let flh{X, c) be the number of colorings of a tree of height h where the root is 
colored c and the coloring is consistent with X. Then, 

A 

l[J2^h^^i^^J) 
(I +^ \ (T \ Y\ ^h{X,c) i=l/#c 

^j,h{(J[root) = c\a[Lh) = A) — — 



gn^x,.) j.^^„^__„,^,) 



d£ki=l f^d 



By the assumption that X extends to a coloring oiTh, it must be that each Xi extends 
to a coloring of the tree Th-i hence. Hi X]f ^h-i{Xi, f) / 0. Dividing the numerator 
and denominator by this factor, we obtain 



= n,(A, c). 



nh-i{X,,d) 



A 



gn-sfi^^^ £n-«.-..-^-. 



D 

Henceforth, we assume that C > 1 and k = CA/lnA. Define the parameter 
e = e{C) = min{C - 1, 1/3}. Notice that fc > (1 + e)A/hi A. We also assume Ao(e) 
is a large constant depending only on e. 

We now give a recursive definition of a coloring of the leaves being unbiasing. In 
the base case, let X G {*, 1, . . . , fc}^ be a (partial) coloring of Li. We say that X 
is unbiasing if and only if at least A*^/^ colors do not appear in X. For £ > 1 let 
X = (Xi, . . . , Xa) be a coloring of Lg where Xi is the coloring of the leaves of the 
subtree rooted at the i-th child of the root. We say that X is unbiasing if and only if 
at most A^^^ of the Xi are not unbiasing. 

Indeed, given an unbiasing coloring of the leaves, the color at root cannot be too 
biased. 

Lemma 3.2. For any i > 1, any unbiasing coloring X of Lg, and any color c, 

PfXX,c) < A-"/2. 



The main result in this section is that if X is a random coloring of the leaves L^, 
then X is unbiasing with very high probability. We use X ^ iii{a{L)) to denote that 
X is a random coloring of the leaves of the tree T^, i.e., choose a from /i^ and let 
X^a(L). 

Theorem 3.3. For all £> I, 



Prx~A'«('T(i)) [^ *'' unbiasing] > 1 - e 



For a given (., let h{v) be the distance (or height) of the vertex v from the leaves 
Li, and X^ be the restriction of the coloring X to those leaves that are in the subtree 
rooted at v. We call X highly unbiasing if for every v with h{v) > e£, Xy is unbiasing. 

Corollary 3.4. For all £ > io{e), 



Prx~Mf(<T(L)) [X is highly unbiasing] > 1 



_^el/3 



e 



Proof. This follows from Theorem 13. 31 using a simple union bound once we notice 
that there are 0{A^^~'^'>^) vertices v with h{v) > eL D 

The rest of this section is dedicated to proving Lemma [3?2l and Theorem l3.3l 



3.1. Properties of partial colorings of the leaves. We now show that an 
unbiasing coloring of the leaves indeed implies a not-too-biased color at the root. 

Proof. [Proof of Lemma 13. 2| The proof is by induction on £. Let X be an 
unbiasing coloring of the leaves Lt. We need to show that Pt{X^ c) < /Sr^/"^ for every 
color c G [fc]. For the base case when £ — 1, by definition, there are at least A^/^ colors 
not appearing in X, and hence Pi(X, c) < A~^/^ for every color c. For ^ > 1, write 
X — (Xi, . . . , Xa), where Xi is the coloring of the leaves of the subtree rooted at the 
i-th child of the root. By definition, at most A^^*^ of the Xi are not unbiasing. By 
induction, ii Xi is unbiasing then for all colors c, P^_i(Xi, c) < A~^/^. If, however, Xi 
is not unbiasing then there are at most A"^/^ colors c for which Pi_i{Xi,c) > A~^/^ 
because by Lemma [XT] we have J2de\k]^h-i{Xi,d) = 1. Therefore, there is a set 
G C [fc] of size at least fc - A^-'^A'^/^ = fc - A^^"^^ such that for aU d e G, and for 



every i G [A], P£^i{Xi,d) < A "^Z^. Hence, we have 



l[il-Pe-iiX..,c)) 
Pi{X,c) = ^^i- < 



A - A 

dG[k]i=l d£Gi=l 



Using the arithmetic-geometric mean inequahty, 



1 1 /l4-A-^/2 ^ 

PeiX,c) < - ^°^P iGi EE^^^i(^"^) 

d£Gi=l 

where the second inequahty uses the fact that if a; < 6 < 1/10 then 1 — a; > exp(— (1 + 
S)x), and that A > Ao(£). Using induction and plugging in the lower bound \G\ > 
k — A^"*^/^ > (1 ~ A~^/'^)fc (for A large enough), we obtain 



1 /(l + A-'^/2)A\ A(i+=)(i-^-"/') /o 



Before we go on to bound the probability that a random coloring of the leaves 
is unbiasing (i.e., prove Theorem 13. 3p . we state some basic lemmas which give some 
independence, allowing a recursive solution. 

Lemma 3.5. Let X ^ iie(a{L)). For any < h < i, let wi, . . . ,w^i-h denote 
the vertices which are at depth £ ~ h from the root. Let X — {X^^, . . . , X^ ^_^) 
where X^^ is the coloring of the leaves of the subtree rooted at Wi . Then the X^. are 
identically distributed as ^h{a{Lh))- 

Proof. Consider the following recursive method for constructing a random coloring 
of the leaves: choose the color of the root uniformly at random from k colors and 
independently choose the color of each child from the k — 1 remaining colors. Now 
consider a vertex v at height h from the leaves. Since each color is equally likely to 
appear at w, the distribution of the coloring at the leaves of the subtree rooted at v 
is identical to the distribution over colorings of leaves in a random coloring of T^ . D 

Lemma 3.6. Let X = (-'^i,--- -.X/^) ^ fie{a{L)) where Xi e [k]'^^'^ is X re- 
stricted to the subtree rooted at the i-th child of the root. Let Ui denote the event that 
Xi is unbiasing. The events Ui,! < i < A are independent. 

Proof. The proof follows from the Markovian property of the Gibbs measure (the 
configurations on the subtrees are independent of each other once we condition on a 
spin at the root), and the fact that the event of being unbiasing is symmetric with 
respect to colors. Formally, for / C [A], let Uj = f]^^jUi. It is enough to show that 



if j ^ /, then fiii{Uj\Ui) — ^iiiUj). Now, 

^ll,{Uj \Ui) = ^ ^J,i{cr{root) = c | Ui) ■ fii{Uj \ a{root) = c , Ui) 

ce[k] 

— y. ^'li'^i'root) = c\Ui) ■ ne{Uj \ a (root) — c) 
ce[fc] 

= 2_, fJ-ei^'ifoot) = c) ■ fii{L(j I a{root) — c) 
ce[k] 

where the second equahty follows from the Markovian property of the Gibbs measure 
and the third equality uses the fact that the event Uj is symmetric with respect to 
the color at the root. D 

We are now ready to complete the proof of the main theorem in this section. 

Proof. [Proof of Theorem 13.3] Let q^ = Prx~/ij.(o-(L)) [^ is not unbiasing] . We 
need to show that 

^ _A(i-(l-B))/2 

which we will do by induction on £. For £ > 1 let the random coloring X = 
{Xi, . . . , X^), where Xi is the random coloring of the leaves of the subtree rooted 
at the i-th child of the root. Assuming A > Ao(£) (so that e^'^ < A~^) we have 

qe = Prx^^f (cr(L)) [> A^^^ of the Xi are not unbiasing] (by definition) 

< A'^'"' • {qi_i)'^''" (by Lemmas [Ml and [331) 
<{Ae~^"^''-"''r'-' (by induction) 

< e 

In the base case oi £ = 1 the colors at the leaves are chosen independently (from fc — 1 
colors, say from {1, • • ■ ,k — 1}) once the color at the root is fixed. We have to upper 
bound the probability that there are less than A^/^ unused colors. Let S denote the 
set of unused colors. Let x^.c be a 0-1 valued random variable which is 1 if the color 
c is chosen at the vertex i. Then the number of unused colors is given by 

fc-l A 

By linearity of expectation and independence over the children, we have 

n\s\] - E n(i - E[x,,,]) = fc (i - -1-) > lOA^/^ 

where the last inequality follows by using the fact that for A > Ao(e), A < (1 — 
^)(fc — 1) ln(fc — 1) and, for a; < J, 1 — a; > exp(— (1 + S)x). It is not difficult to verify 
(for example by induction) that for Y^ := J^j(l — Xi,,), the variables {e'^^^}ce[q] have 
negative covariance for any t > 0. Hence, we can apply the usual Chernoff bounds, 
(see, e.g., the proof of Proposition 7 of llj), to show that Pr[|S'| < A"^/^] — qi < 



4. Convergence. In this section we complete the proof of Theorem II .21 i.e., 
establish exponential decay of correlations for the measure. We do that using cou- 
pling arguments where disagreements percolate down and up the tree, along the lines 
suggested in |23j . The main addition in our arguments is using the properties of the 
measure established in the previous section (in particular, Corollarv l3.4p to get a bet- 
ter bound on the probability that a disagreement percolates upwards. (In [23] worst 
boundary conditions were assumed for the upward coupling.) 

We will establish a slightly different, but still equivalent to that stated in Theo- 
rem ll.21 form of the decay of correlations. For every color c, let /z^ ^ be the distribution 
of the coloring of L( conditioned on the color at the root being c. For every coloring 
^i let /x)f I 



X of the leaves Lp let u v i, be the distribution of colors at the root conditioned on X 



iT 



being the coloring of the leaves. Finally, let /i^ ^ be the distribution of colors at the 

root resulting from first choosing X from /i^ g and then choosing a color for the root 

from Hxv Non-reconstruction is equivalent to the dependence of /x^^^ on c vanishing 
with (.. We establish the decay of this dependence in the following theorem. 

Theorem 4.1. Let C > 1. There is a constant a{e) > such that for all A > Aq, 
every k > CA/ In A, and every pair of colors ci, C2 G [k], for i > £q, 

Indeed, it is not too difficult to see that this theorem implies Theorem 11.21 (See 
Section[6]for a proof of the equivalence between the two forms of decay of correlations.) 

We prove Theorem 14.11 bv coupling the two distributions. The coupling consists 
of two steps, the first is coupling the colorings of the leaves (downward coupling) 
conditioned on the disagreement at the root, and the second is coupling the color at 
the root (upward coupling) based on the pair of colorings of the leaves achieved in 
the ffist step. 

We start by bounding the average Hamming distance between the two coupled col- 
orings of the leaves resulting from the ffist step. For two colorings X, Y, let Hi{X, Y) 
stand for the Hamming distance between X{Li) and Y(Li), i.e., the number of leaves 
of the tree of depth £ in which X and Y differ. 

Lemma 4.2. For every £ and any ci,C2 G [k] there is a coupling v — v^ ^ ^ of 

/x^^ ^ and ii^.^ £ such that 



^(x.Y)^. [Hi{X,Y)] = (^^-^j < In^(A). 

Proof. We let I'j: ^ ^ be the following recursive coupling: For £ > I, starting from 
the disagreement (ci,C2) at the root, couple the spins at each of the children of the 
root independently according to the optimal coupling for each child. For each child 
vertex, if the resulting spins at this vertex agree then the whole subtree can be coupled 
to agree. If the spins disagree, say a {c[,C2) disagreement, continue recursively using 
the coupling i^j;, ^, ^_-^, where for the base case the coupling vj:^ ^^ g is trivial. Notice 
that in our setting, the probability of a disagreement percolating to a given child 
isK=l/(fc— 1). To see this notice that the distribution at the child conditioned 
on the color of the root being c is uniform over [k] \ {c}. Thus, conditioned on a 
(ci, C2)-disagreeinent at the root, the two distributions at the child can be coupled 
such that the colors agree with probability |^, i.e., that they agree whenever a color 



10 

other than ci , C2 is chosen. Clearly, the average Hamming distance at the leaves is 

The next ingredient we need is a bound on the distance between two conditional 
distributions at the root in terms of the Hamming distance between the two boundary 
colorings. We establish such a bound for boundary colorings that are highly unbiasing. 
We say that a pair {X, Y) of colorings of the leaves is good if both {X, Y) are highly 
unbiasing. 

Lemma 4.3. For any given £ and any good pair {X, Y) of colorings of the leaves 
Li 

Let us complete the proof of Theorem 14 . 1 1 assuming Lemma W^ For any pair {X, Y) 
of colorings of the leaves Li let i^x y i ^'e the optimal coupling of /x^^ and /Uy^, i.e., 
the probability of disagreement between the pair of colors chosen in this coupling is 
exactly drvil^'x i ^ l^Yil- We then set the coupHng vH ^^ g as follows. First, choose 
a pair (X, Y) of colorings of the leaves from v^ ^ f^] then, choose a pair of colors at 
the root from the coupling Vxvi- Notice that v^/^ g is indeed a coupling of /i^\ 
and [I]} I- We need to bound the probability that the two colors at the root chosen 
according to this coupling disagree. Let {ctc!) be the pair of colors chosen from this 
couplings. We have 

Pr[c 7^ c'] = 5] ^Lc.M^ ^) • '^TviA.i , A.i\ 

X,Y 

good {X, Y) 
bad {X,Y) 

< 2(2A-/2)(i--)^ Y. ^LcM^ ^) • ^^(^' ^) 

X.Y 

+ 2VYxr^^e [X is not highly unbiasing] 
<2(2A"^/2)(i--)^.W(A) + 2e-^"'' 

assuming A > Ao(£). Notice that in the first inequality above we used Lemma 14. 3[ 
in the second we used a simple union bound together with the fact that the event 
of having a highly-unbiasing coloring of the leaves is independent of the color at the 
root, and in the third inequality we applied Lemma 14.21 to bound the first expression 
and Corollary 13.41 to bound the second. This completes the proof of Theorem 14.11 
assuming Lemma 14.31 

4.1. Proof of Lemma 14.31 The proof of the lemma goes along similar lines to 
those in [23], i.e., reducing to the case of a single disagreement using a sequence of 
boundary colorings where one leaf is changed at a time, and then using a coupling 
argument to bound the probability that the disagreement at the leaf percolates all 
the way up to the root. However, our argument differs in two points. The first is 
that here we have to make sure that the intermediate boundary colorings are also 
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valid, and more importantly, that they are highly-unbiasing. The second difference is 
that we use the fact that the colorings are highly unbiasing to get a better bound on 
the probability of a disagreement percolating upwards. (In our setting, if we do not 
assume highly unbiasing colorings at the leaves then in the worst case the disagreement 
may percolate upwards with probability 1.) 

Let {X, Y) be a pair of colorings of the leaves, m — H{X, Y), and (di, . . . , dm) 
be an enumeration of the leaves in which X and Y differ. We construct a sequence 
of colorings X = Zq, Zi, . . . , Zm, ■ ■ ■ , ^2m — Y going from X to y by un-conditioning 
the sites in which X, Y differ one at a time and then adding the condition according 
to Y one site at a time. Formally, for < i < m, Zi{v) = -k ii v = dj for some 
1 !i i ^ ij E^nd otherwise Zi{v) — X{v). For i > m, Zi{v) = -k ii v = dj for some 
1 < j < 2m — i, and otherwise Zi{v) = Y. It is easy to see that indeed for every 
i > 0, {Zi, Zij^i) differ at exactly one leaf. We now show that each of the Zi is a valid 
highly-unbiasing coloring. 

Claim 4.4. Let X,X' he assignments from the leaves Li to {*, l,...,fc} such 
that for all v ^ Li we have X'{v) ^ * implies X'(v) — X{v). If X is a valid unbiasing 
coloring of Li then so is X' . 

Proof. It is clear that if X is a valid coloring then so is X' because X' can be 
extended to X. The unbiasing implication follows by induction on (. from the recursive 
definition of being unbiasing once we establish it for the base case oi (. — 1. Indeed, 
for £ = 1, the number of unused colors in X' is at least the number in X . Thus, if X 
is unbiasing then so is X'. D 

Corollary 4.5. Let X,X' be assignments from the leaves Li to {•, l,...,fc} 
such that for all v G Li we have X'{v) ^ -k implies X'{v) — X{v). If X is a valid 
highly-unbiasing coloring of Li then so is X'. 

Now, by the triangle inequality, the proof of Lemma 14.31 will be concluded once 
we show the following. 

Lemma 4.6. Let Z, Z' be two highly-unbiasing colorings of Li that differ at exactly 
one vertex v d Li. Then, 



T'TV 



[iii{(j{root) I a{L) = Z) , ^ie{a{root) \ a{L) = Z')] < (2A-'^/2)(i-^)^. 



The proof of the above lemma goes by an upward coupling argument similar to that 
used in j23j, that is, we couple the spins along the path from v to the root using a 
recursive coupling similar to that used for the downward coupling. Specifically, given 
the disagreement at v, we first couple the colors at the immediate ancestor u (of v) 
using an optimal coupling that minimizes the probability of disagreement at u. If the 
resulting spins at u agree, then we couple the rest of the path with total agreement. 
If the spins at u disagree then we continue recursively. The variation distance at 
the root is bounded by the probability that the disagreement at the leaf v percolates 
along the path from v all the way to the root. Thus, the important missing ingredient 
is a bound on the probability of disagreement at a vertex u given a disagreement at 
(exactly) one of its children (when the coloring of the leaves is Z) . This can be done 
once we have a bound on the maximum probability of any color at w, as exemplified 
by the following proposition, which is taken from [24 . 

Proposition 4.7. Let ^ he a distribution of proper coloring of some graph ac- 
cording to some boundary condition, and let /i„ he the marginal of fi at u. Consider 
now the distribution resulting from the same setting when adding a neiu neighbor to u 
(connected only to u) and fixing the neighbor's color to c. Let ^'^ he the marginal of 
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the latter distribution at u. Then for any two colors Ci,C2, 

Mu(ci) Mu(c2) 



drv [fJ-u : Mu' ] = max {/^^^ci ) , /x^^ (ca ) } 



1-M«(c2) l-Mu(ci) 



/n particular, if we let p^'^^ = maxc Hu{c) then drvil^u t l^u] — Pu'^^/i^ ~ Pu^^)- 

Proof. We first notice that fi'^ is /i„ conditioned on the resulting color not being c, 
i.e., for every color c' ^ c, A^^(c') = ^„(c')/(l — /z„(c)). Assume without loss of 
generality that /x„(ci) > ^„(c2). It follows that for every color c' ^ ci, n'^[c') < 
pQ{c'). Therefore, dTviA'u^j M?] = M?(ci). Q Going back to our setting of the upward 
coupling, Lemma 14.61 will follow once we show that for highly-unbiasing Z, Z' , for 
every vertex w on the path from v to the root which is at height > e^, given that the 
disagreement percolated to w, the probability that the disagreement percolates to the 
immediate ancestor m of lo is at most 2A~^/^. Notice that once a color at w is fixed, 
the distribution at u is independent of the rest of the coloring of the subtree rooted 
at w, and so that subtree (except for w) can be completely ignored. Thus, we are in 
position to use Proposition 14.71 to bound the probability of disagreement at u given 
a disagreement at w by p™^''/(l — P™'''^), where p^'^^ in our setting is the maximum 
probability of a color at u conditioned on Z at the leaves and when the edge (u, w) is 
removed from Tg (i.e., the subtree rooted at w is removed from Tg). 

The fact that Z is highly unbiasing implies that the conditional probability p^^^ 
can be bounded. 

Lemma 4.8. Let u he a vertex of the tree Ti at distance at least ei from the leaves 
and Z a highly unbiasing coloring of Lg . Let p^^^ be as defined above. Then, 



max ^ \~e/2 



P 



Proof. The proof is by the same arguments as in Lemma 13.21 so we don't repeat 
them, except to point out the following. The probability considered in Lemma 13.21 is 
that of the subtree rooted at u conditioned on Z„, i.e., this subtree is disconnected 
from the rest of Tg. Here, the distribution considered is when u is still connected to 
the rest of Tg, but with the subtree rooted at w removed from the tree. However, it 
is not too difficult to see that if Z is highly unbiasing then the same bound on the 
maximum probability applies to the distribution considered here. 

Consider the orientation of Tg (after removing the subtree rooted at w) so it is 
rooted at u. Since .Z is a highly-unbiasing boundary coloring of Ti (in its original 
orientation) then at least A — 1 of the children of u are unbiasing (those that are 
its children in the original orientation), and hence Z is a highly unbiasing boundary 
condition for the (irregular) newly-oriented tree rooted at u. In the newly oriented 
tree, the distribution considered in Lemma 13.21 does correspond to that considered 
here for p™'''^. D 

By the lemma, for every vertex u at distance at least e£ from the leaves, p'^^^/{l — 
pmax) < 2A-^/2 and hence 



2TV 



[M<^{root) I a{L) - Z) , ne{a{root) \ a{L) = Z')] < (2A-^/2)(i-s)£^ 



completing the proof of Lemma 14.61 and hence the proof of Lemma [ 

4.2. Concentration. We now go on to prove Theorem II. 3( i.e., establish that 
not only does a typical boundary coloring yield a near uniform distribution of colors 
at the root, but that the fraction of boundary colorings having a bias at the root are 
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extremely small. The m.ain additional technical tool that we use here is a reduction 
given in |23| from the tail bound stated in Theorem 1 1.31 to an appropriate tail bound 
in the coupling vK 

Lemma 4.9. For any given i, d > and A > 0, if for every ci,C2 G [k] there 
exists a coupling v of jj.^ ^ and fi^ ^ such that 



Pr(x,Y)~i' 



drvifJ-xj' f^Yj] > ^^ 



<A 



then for every c £ [k], 



Prx. 



'M« 



\^i\^,{c) - l/k\ > 26 



<2(e-V^ + ^ 



Proof. Arguments proving a similar statement to that of this lemma are given in 
Section 5.2 of [53]. For completeness, we repeat the proof in Section [B] D 

As we show next, the coupling v^ constructed above satisfies the hypothesis of 
Lemma 14.91 with the necessary parameters. In fact, the only ingredient missing in 
establishing the necessary tail bound for v^ is a tail bound on the number of disagree- 
ments in the coupling, which in turn follows from the (product) percolation nature of 
the coupling. 

Lemma 4.10. For every £ > ^o(e) and ci,C2 G [k] 



^^(X,Y)- 



Hi{X,Y) > A'^^^ 



< e-"^'""'. 



Proof. Recall from Lemma |4~2] that the average Hamming distance between X, Y 
is < In A. The tail bound can then be established as done in, e.g.. Lemma 5.5 of 
[23]. For completeness, we give another proof in Section [U D 

Now, let us refine the notion of a pair of colorings {X, Y) being good by saying that 
a pair {X, Y) of colorings of the leaves is very good if {X, Y) is good and Hi(X, Y) < 
A^^/^. It follows from Lemma l473l that for any very-good pair {X,Y), 



On the other hand, by Lemma 14.101 and Corollary 13.41 (where the latter is used as in 
the proof of Theorem 14. ip we have that 

r/ s 1 A^f/lO AE*/3 Aef/10 

Pr,^ y.^i [{X, Y) is not very good] < e'^ + 26'^ < 2e-^ 

Thus, t'c c £ satisfies the hypothesis of Lemma HT^ with S = A^''^ and A = 2e~^^ , 
completing the proof of Theorem 11.31 

5. Entropy constant for the block dynamics. In this section we prove the 
rapid mixing result for the block dynamics as stated in Thcorem ll.51 i.e., we establish 
a lower bound on the entropy constant associated with this dynamics when the block 
height is a large enough constant. We rely on results established in [23], where strong 
concentration of the measure as established in Theorem 11.31 above is shown to imply 
the necessary bound on the entropy constant of the block dynamics. We give here the 
high level motivation behind the kind of argument made in [23] and some necessary 
adaptations to our setting, while for the main technical result we simply cite the 
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relevant theorems from [23 . Notice that while the arguments in [53] are written for 
the Ising model, they hold for general models on trees. (We refer to [3T] for the general 
version of these arguments.) 

To see the connection between strong concentration of the measure and the en- 
tropy constant, recall that the dynamics makes heat-bath updates in a random block, 
i.e., erases the local entropy in the updated block. Thus, the amount of entropy erased 
in one step of the dynamics is the average (over blocks) of local entropies, and in order 
to bound the entropy constant it is enough to show that the global entropy of every 
function is not much more than the sum (over blocks) of local entropies of the same 
function. The ability to express the global entropy of every function as the sum of 
local entropies up to a constant factor can be thought of as manifestation of locality 
in the equilibrium measure. Indeed, the main technical result in '231 is that on trees, 
strong concentration of the measure as in Theorem 11.31 is equivalent to the ability to 
express entropy locally as above. 

To make things formal, let / be a function on the space of colorings fl. The local 
entropy of / in a subset A of T„ is E[Ent(/ | (Jt„\a)]- Consider the block dynamics 
A4i and let P„ be the transition matrix of the dynamics when the block to be updated 
is the one rooted at v. Since the dynamics makes heat-bath updates of the block By^i 
then for any function /, Pvf{a) = E(/ | aT„\B^^f), while Pf = jj Y.vgt^ P^f- ^o^' 
for any subset A of r„ we have by definition of entropy that 

Ent(/) = Ent[E(/ | d^)] + E[Ent(/ | a^)]. 

In particular, we have that 

Ent(/) - Ent(P,/) = E[Ent(/ | <tt„\b„,J]' 

i.e., the effect of applying Py is to erase the local entropy of / in By^i. Let us denote 
the sum of local entropies as 



Ei-.^ J2 E[Ent(/|ai.„\s„, 



veT„ 
Since entropy is a convex functional then 

Ent(P/) - Ent ( 1 5^ P./) < ^ E Ent(P„/). 

In particular, 

Ent(/) - Ent(P/) > 1 ^ [Ent(/) - Ent(P„/)] = j^ ■ £i ■ 

Thus, in order to prove the lower bound on the entropy constant stated in Theorem ll.5i 
it is enough to show that for an appropriate choice of the block height £ = £{e) (i.e., 
not depending on n). 

Now, a key technical ingredient in [23] is that strong concentration of the measure 
as established in Theorem II .31 implies (|5.ip . In order to state the formal implication 
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we need a couple of more definitions. For a vertex v ^ T.^, let T„ be the subtree 
rooted at v. For a color c G [fc], let fi^ be the uniform distribution of colorings of T„ 
conditioned on the parent of v being colored c. For v whose distance from the leaves 
is at least i, let Ly^g be the set of vertices at distance i below v. (Equivalently, Ly^g is 
the bottom boundary of By^e.) The following theorem combines Theorems 3.4 and 5.3 
of [53] (or Theorem 5.10 and Lemma 5.18 in PP, where the formulation is closer to 
the one here). 

Theorem 5.1. For some absolute constant a*, the following implication is true 
for every choice of i > 1. If for every vertex v whose distance from the leaves is at 
least £ and for any two colors ci, C2, fJ, = n^ satisfies 



Pr^„ 



/i(cr(u) = C2 I a{Lyj) = TJ Lyj)) ^ 



> 



a*k^{e + iy 



^ ~2a'k''(e+iy 



fi{a{v) = C2) 

(5.2) 
then for every function f > 0, Ent(/) < ASg. We conclude by noticing that from 
Theorem 11.31 (see the following Remark 15.21 for a proviso) we get that (|5.2[) (and 
therefore (|5.ip ) holds for an appropriately chosen i, e.g., £ > ^.^9^ ^,| suffices where 
a* is the absolute constant from Theorem 15.11 and a = a{e) and a' — a'{e) are from 
Theorem 11.31 

Remark 5.2. Notice that Theorem \5. 1\ requires us to prove the strong concen- 
tration of the measure stated in Theorem \1.3\ in a slightly more general scenario. 
Specifically, we need to show similar bounds for a random boundary coloring of By^g 
conditioned on an arbitrary color at the parent of v and when this boundary coloring 
is chosen from the measure on T^ ■ Notice, however, that the distribution of colorings 
at the boundary of By^i under the distribution of colorings of Tn is exactly the same 
as the distribution of colorings of the leaves of the smaller tree in which the vertices 
on the boundary of By^g are the leaves. As for fixing an arbitrary color above v, it 
is straightforward that the concentration stated in Theorem \1.3\ holds in this scenario 
as well. (In this case the distribution at the root is uniform over k — 1 colors so the 
statement of the theorem is modified with -j-^ replacing t .) 

Remark 5.3. In 123] (and {^ \MI), the bound in i5.1\) is used to bound the 
mixing time of the single-site dynamics. This is done by introducing an extra factor 
that expresses the decomposition of the entropy in a block (under an arbitrary boundary 
condition) into the local entropy (or variance) coming from single sites in the block. 
This factor is bounded by the mixing time of the single-site dynamics in the block, 
which in the setting of \23\ \24\ \31}/ is a constant depending on the block size. However, 
in our setting there are valid boundary colorings of the block for which the single-site 
dynamics is disconnected (i.e., infinite mixing time) so we cannot apply the same 
reasoning and suffice with a bound on the mixing time of the block dynamics. 

6. Leftover Proofs. Here we provide the remaining proofs. Most of the argu- 
ments given in this section already appear in the references mentioned in the main 
text, but we give them here for completeness and in order for the proofs to correspond 
exactly to the formulations of the statements made in this paper. 

We start by showing that the form of decay of correlations stated in Theorem l4.1l is 
indeed equivalent to that stated in the main Theorem ll.2l Let ac,i — Ex^^^ [\f^x ei^)~ 
l/k\] and Pcj = llJ-iii^) ~ 1/^1- Notice that Theorem 14. 1 1 establishes that for every c 
and £ I3c,i — A~^(^\ while for Theorem 11.21 we need to show that for every c and £, 
ac,e. = A~^'^'. The following proposition shows that the two statements are indeed 
equivalent. 



16 

Proposition 6.1. For every c and £, ^^ < acj < \/Pc,i- 

Proof. Let gc.ei') be the function on coloring of the leaves Li defined as follows: 

Notice that Ex~^^[gc(-'^)] = 1- Furthermore, Ocj = iEx^^JIffcC^) - 1|]. and Pc,e = 
lExr^^i[\gc{X)\'^ - 1] = iEx~^J|5c(^) - Ip]- The first inequality of the proposition 
then follows from the fact that ||5c||oo < k, while the second inequality follows from 
Cauchy-Schwartz. D 

We now provide the omitted proofs of the two lemmas in Section 14.21 
Proof. [Proof of Lemma l¥!^ First, notice that since iii{(j{Li)) is a convex combi- 
nation of ^^ ^ we can assume w.l.o.g. that for every color c we have a coupling of /i^ ^ 
and /i^ with properties as in the hypothesis of the lemma. Using the same definition 
of gc,i as in Proposition 16. 11 for any a > we then have that 

P^x^^i ^ [dcAX) > 1 + «] < Prx-w [dcAX) > 1 + a - fc<5'] + A. 

On the other hand, by the definition of gd, 

Pi-x.^i^^ [9cAX) > 1 + a] > (1 + a)Prx~M. [9cAX) >! + «]• 

Combining the above two inequalities we get that 

Prx~M. [9c AX) > 1 + a] < ij^A (Prx-^. bcAX) > 1 + a - k5^] + A) . 

In particular, for every non- negative integer to, if we apply the above inequality m-\-\ 
times, increasing a by kS^ each time, we get that 

Pvx^,A9cAX)>l + a + mk6^] < (1 + a)-^"'+'^ + A < e-"('"+i)/(i+«) + - . 

ka a 

Now, by setting a = kS and to — [{l/S)^\ we get that 

Prx^^A9cAX) > 1 + 2kS] < e-*("+i) + | . 

Since the event gcAX) > 1 + 2kS is equivalent to /i^ ^(c) — 1/fc > 2(5, this establishes 
the bound on the positive side of the tail in Lemma l4!9l The negative side of the tail 
is established along similar lines once we notice that from the hypothesis we have for 
any a > 

Prx-M. bcAX) <l-a- k6'^] < Pr^^^i ^ [gcA^) < I - a] + A 

and by definition of gc^g, 

Pr^^^t^[gcAX)<l-a] < (l-a)Prx^^Jg,,,(X)<l-a]. 

D 

Proof. [Proof of Lemma [4.10j Recall that the coupling i/^ is constructed recursively 
such that if the spin at a vertex v agree then all of its children are coupled with 
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agreement with probability 1, while if there is a disagreement at v then its children 
are coupled independently where the probability of disagreement in each of the A 
children is l/(fc — 1). Thus, if we let Di be the random variable counting the number 
of disagreements at level i in this coupling then Dq = 1 and -D^+i is distributed 
as Jim{ADi,j^). We need to establish an upper bound on Pr{D( > A^^'^). Let 
io — e£/9. For every iq < i < £ we say that a failure occurred at level i ii Di > 
(3inA^i-»o . ^io_ Now^ notice that given A, E[A+i] = i^Di < A In A. Thus, by 
a Chernoff bound, for every i > io, 



Pr[ failure in level i + 1 1 not failure in level i] < cxp — ■ I — - — ) A 



to 



Now, since Pr[ failure in level i + 1] < Pr[ failure in level i + 1\ not failure in level i]~\ 
Pr[ failure in level i] and since Pr[ failure in level io] = 0, we have by induction that 

e 
Pr[ failure in level £] < y^ Pr[ failure in level i \ not failure in level i — 1] 

i=io + i 

-A*" ' 



< 2 exp 
16 

< exp (-A=^/i° 

Notice that the event of failure in level i includes the event that Dt > A^^/^ so we 
are done. D 
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